Spatial Growth of Real-world Networks 
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Many real-world networks have properties of small-world networks, with clustered local neighbor- 
hoods and low average-shortest path (ASP). They may also show a scale- free degree distribution, 
which can be generated by growth and preferential attachment to highly connected nodes, or hubs. 
However, many real-world networks consist of multiple, inter-connected clusters not normally seen 
in systems grown by preferential attachment, and there also exist real- world networks with a scale- 
free degree distribution that do not contain highly connected hubs. We describe spatial growth 
mechanisms, not using preferential attachment, that address both aspects. 

PACS numbers: 89.75.Hc, 89.75.Da, 89.40.Bb, 82.30.Nr 
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I. INTRODUCTION 

Many real-world networks show small-world proper- 
ties P|. Their average clustering coefficient, represent- 
ing the proportion of direct links between the neighbors 
of a node, is higher than in same-size random networks, 
while they maintain a comparable average shortest path 
(ASP). The giant component of some of these networks 
has been shown to consist of several clusters, which con- 
tain strongly interlinked nodes and form only sporadic 
connections to other clusters. For instance, the corti- 
cal systems networks in macaque monkey and cat brains 
possess such a multi-cluster organization J^. Moreover, 
various complex linked systems have been described as 
scale-free networks 0)3) in which the probability for a 
node possessing k edges is P{k) oc k^'^. It has been sug- 
gested that this large class of networks may be generated 
by mechanisms of growth and preferential attachment, 
that is, the preferred linking of new nodes to already 
highly connected network nodes 01 ■ An essential aspect 
of many real-world networks is, however, that they exist 
and develop in metric space. Therefore, questions arise 
how nodes are able to identify highly connected distant 
hubs and why they would attach to them, rather than 
to nearby nodes [5j- Moreover, long-range connections 
to hubs violate optimal wiring principles For exam- 
ple, a city in New England would normally consider con- 
structing a new highway to nearby Boston, rather than 
to faraway Los Angeles, even if Los Angeles represents a 
larger hub in the US highway system. 

Previous spatial growth algorithms, in which the prob- 
ability for edge formation decreased with node distance, 

Eedetermined the position of all nodes at the outset 
18,]. Starting with the complete set of nodes, which 
was distributed randomly on a spatial grid, connections 
were established depending on distance ^ [ifl uM ■ Ad- 



ditionally, connected nodes could be drawn together by 
a posteriori pulling algorithm, which resulted in spatial 
clusters of connected nodes [ij . Such mechanisms, how- 
ever, appear unsuitcd as a general explanation for grow- 
ing biological and artificial systems with newly forming 
nodes and connections. 



II. SPATIAL NETWORK DEVELOPMENT 
ALGORITHM 



In an alternative approach, we employed a model of 
spatial growth in which the nodes, their positions and 
connections were established during development. Start- 
ing with one node at the central position (0.5; 0.5) of the 
square embedding space (edge length one), the following 
algorithm was used: 

1) A new node position was chosen randomly in two- 
dimensional space with coordinates in the interval [0; 1]. 

2) Connections of the new node, u, with each existing 
node, V, were established with probability 
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where d{u, v) was the spatial (Euclidian) distance be- 
tween the node positions, and a and (3 were scaling co- 
efficients shaping the connection probability . 

3) If the new node did not manage to establish connec- 
tions, it was removed from the network. In that way, 
newly forming nodes could only be integrated within the 
vicinity of the existing network, making the survival of 
new nodes dependent on the spatial layout of the present 
nodes. 

4) The algorithm continued with the first step, until a 
desired number of nodes was reached. 

Parameter /3 ("density") served to adjust the general 
probability of edge formation and was chosen from the 
interval [0; 1]. The nonnegative coefRcient a ("spatial 
range") exponentially regulated the dependence of edge 
formation on the distance to existing nodes. Such spatial 
constraints are present during the development of many 
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real networks. In biological systems, for instance, gradi- 
ents of chemical concentrations, or molecule interactions 
decay exponentially with distance jl^ . 

The algorithm allowed some nodes to be established 
distant to the existing network, although with low prob- 
ability. Subsequent nodes placed near to such 'pioneer' 
nodes would establish connections to them and thereby 
generate new highly-connected regions away from the rest 
of the network. Through this mechanism multiple clus- 
ters were able to arise, resulting in networks in which 
nodes were clustered topologically as well as spatially. 

In a slightly modified approach the growth model could 
employ a power-law to describe the dependence of edge 
formation on the spatial distance of nodes: 

P{u,v) ^ a d{u,v)-^. (2) 

By this mechanism the probability of establishing dis- 
tant nodes would be increased even further. For ex- 
ample, simulating networks of similar size (50 networks; 
n = 100; density = 0.04; square embedding space edge 
length 100) for both types of distance dependencies, the 
power-law (Eq. |21 a = 1, r = 1) resulted in higher 
total wiring length (6303) compared to networks gener- 
ated by exponential edge probability (Eq. ^ a = 0.35, 
/3 = 1, total wiring length 1077 units). In the following 
investigations, however, we concentrated on the expo- 
nential approach outlined above, since our simulations 
indicated that power-law edge probability was unable 
to yield small-world networks (tested parameter ranges 
cr e [0.004; 2] and r e [0.125; 64]). 

Another essential network feature investigated in the 
model was the presence or absence of hard spatial borders 
that limit network growth. Borders occur in many com- 
partmentalized systems, be it mountains or water sur- 
rounding geographical regions, cellular membranes sepa- 
rating biochemical reaction spaces, or the skull limiting 
expansion of the brain. Depending on coefficient a and 
the network size, our simulated networks never reached a 
hard border ('virtually unlimited growth'), or quickly ar- 
rived at the spatial limits, so that new nodes could then 
only be established inside the existing networks. Nat- 
urally, virtually unlimited growth would eventually also 
arrive at the hard borders, after sufficiently sustained 
network growth. However, in the context of our simula- 
tions, growth could be considered virtually unlimited if 
for a chosen network size at the end of the algorithm all 
nodes were still far away from the borders (by at least 
0.25 units). 

In the following, we describe different types of spatially 
grown networks resulting from low or high settings for 
parameters a and /3, and present examples of real-world 
networks corresponding to the generated types. 

For the generated networks, two network properties 
are shown, which have been used previously to charac- 
terize complex networks 0]. The average shortest path 
(ASP, similar, though not identical, to characteristic path 
length L Q) of a network with N nodes is the average 
number of edges that has to be crossed on the shortest 



path from any one node to another. 

^5P = — i— ^d(z,j) with I ^ J, (3) 

where d{i,j) is the length of the shortest path between 
nodes i and j. The clustering coefficient of one node v 
with ky neighbors is 

where |i?(r^)| is the number of edges in the neighborhood 
of V and ('^2") is the number of possible edges 0|. In the 
following analyses we use the term clustering coefficient 
as the average clustering coefficient for all nodes of a 
network. 

Algorithms for network generation, calculation of net- 
work parameters and visualization were developed in 
Matlab (Release 12, MathWorks Inc., Natick) and also 
implemented in C for larger networks. For each param- 
eter set and network size, 50 simulated networks were 
generated and analyzed (20 in the case of virtually un- 
limited growth, due to computational constraints). 

III. MODELED TYPES OF NETWORKS 

A. Sparse Networks (limited and virtually 
unlimited growth). 

For very small /3 (< 0.01), sparse networks were gener- 
ated (Fig. ^) in which only a small proportion of all pos- 
sible edges was established. The resulting networks were 
highly linear, that is, exhibiting one-dimensional chains 
of nodes, independent of limited or virtually unlimited 
growth (parameter a). The histograms of chain-lengths 
found in these networks, indicating the number of nodes 
in the chains, were similar to those of random networks 
with the same density. Unlike in random networks, how- 
ever, the clustering coefficient was lower than the net- 
work density, and despite lacking clusters and hubs with 
large degree fc, these networks possessed a power-law de- 
gree distribution, with high ASP (Fig. ^). The power- 
law exponent was small, in the range of [1.7; 2.1]; and 
in the simulated networks of 100 nodes the cut-off for 
the maximum degree of the scale-free networks was 16. 
Given their low maximum degree, these networks with 
low clustering and long linear chains of nodes could be 
called linear scale-free. 

Example: German Highway System. We identi- 
fied a linear scale-free organization in the German high- 
way ("Autobahn") system. The highway network of 
1,168 nodes was compiled from data of the 'Autobahn- 
Informations-System' |Q]. The ratio of clustering co- 
efficient and density of the highway system, which can 
be seen as a linearity coefficient, was 0.64. This system 
is also an example for a scale-free (7 = 2.8), yet not 
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(a) (b) 




k 

FIG. 1: (a) Sparse network (density 0.42%) with 500 nodes 
obtained by limited growth {a = 2, f3 = 0.001). (b) Cumu- 
lative degree probability P{k) that a node possesses k edges 
for the network shown in (a). A power- law of the degree dis- 
tribution (7 — 2.43) can be observed. 

small-world, network, as its ASP was twice as large as 
for comparable random networks. 

A similar type of organization was also found for scale- 
free protein-protein interaction networks [T^ (kmax ~ 
20). 

B. Dense Networks (limited and virtually 
unlimited growth). 

For higher edge probability (/3 — > 1), a noteworthy dif- 
ference between limited and virtually unlimited growth 
became apparent. While it was impossible to generate 
high network density under virtually unlimited growth 
conditions, the introduction of spatial limits resulted in 
high density and clustering, as well as low ASP. This 
was due to the fact that, in the virtually unlimited case, 
new nodes at the borders of the existing network were 
surrounded by fewer nodes and therefore formed fewer 
edges than central nodes within the network. In the lim- 
ited case, however, the network occupied the whole area 
of accessible positions. Therefore, new nodes could only 
be established within a region already dense with nodes 
and would form many connections. 

Figure 121 shows the relation between small-world graph 
properties and growth parameters a and f3 for networks 
consisting of 100 nodes. The ratio of the clustering co- 
efficient in spatial growth compared to random networks 
was larger than one (indicating small world graphs), if 
the values for a and (3 were high (Fig. ^.). The ASP in 
the generated networks normalized by the ASP in ran- 
dom networks with similar density was similar for low 
values of a and high values of f3. For these networks the 
likelihood of edge formation was high and — because of 
the low value of a — independent from spatial distance. 
Such networks resembled random growth, with the clus- 
tering coefficient possessing the same value as the density 

{C / Crandom ~ 1)- 

In a small interval of intermediate values for a (a w 4, 
/3 = 1), networks exhibited properties of small- world net- 
works (ASP and clustering coefficient shown in Fig. |3^). 



TABLE I: Comparison of cortical and simulated networks. 
Shown are the clustering coefficient CcorUcai of cortical net- 
works of cat and macaque with a given number of nodes n and 
density d as well as the clustering coefficient C spatial growth of 
generated networks with identical node number and similar 
density. 





n 


d 


C cortical 


C spatial growth 


cat 


55 


0.30 


0.55 


0.5 


macaque 


73 


0.16 


0.46 


0.4 



Here, the ASP was comparable to that in random net- 
works of the same size {ASP ~ ASPrandom), while the 
clustering coefficient was 39% higher than in random net- 
works 9, p. 114]. An overview of the parameter space 
and the resulting random, small-world, virtually unlim- 
ited or linear scale-free networks is given in Figure |3Jd. 

Example: Cortical Connectivity. One biologi- 
cal example for small-world spatial networks with high 
clustering coefficient and high density are the well stud- 
ied, clustered systems of long-range cortical connectiv- 
ity in the cat and macaque monkey brains 0, Il6l Il7| . 
We employed the model in order to generate networks 
with identical number of nodes and edges and compara- 
ble small-world properties. While small-world networks 
could be generated in the appropriate parameter range 
of the model (Fig. Ob), the biological networks fea- 
tured even stronger clustering. We found, however, that 
such networks could be produced by extending the local 
range of high connection probability, so that P = 1 for 
distancccat < 0.18, distancemacaque < 0.11 and P de- 
caying exponentially as before for larger distances (this 
was implemented by setting acat = 5, amacaque = 8 and 
for both networks /3 = 2.5 and thresholding probabilities 
larger than one to one). The modified approach therefore 
combined specific features of the biological networks with 
the general model of limited spatial growth. This yielded 
networks with distributed, multiple clusters, and average 
densities of around 30% (for simulated cat brain connec- 
tivity) and 16% (monkey connectivity). Moreover, these 
networks had clustering coefficients of 50% and 40%, re- 
spectively, very similar to the biological brain networks 
IJl, as shown in Table 

Comparison of the biological and simulated degree 
distributions, moreover, showed a significant correlation 
(Spearman's rank correlation p — 0.77 for the cat net- 
work, P < 3x 10"'^; and p = 0.9 for the macaque network, 
P < 2 X 10"^). On the other hand, the BA-model |3], 
using growth and preferential attachment, yielded simi- 
lar densities and clustering coefficients, but was unable 
to generate multiple clusters as found in the real cortical 
networks. 

In contrast to limited growth, virtually unlimited 
growth simulations with high (3 resulted in inhomoge- 
neous networks with dense cores and sparser periphery. 
It is difficult to imagine realistic examples for strictly un- 
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(a) (b) 




FIG. 2: Comparison of small-world properties of spatial and 
random networks for N=100 nodes. Each data point repre- 
sents the average for 50 networks, (a) Ratio of the clustering 
coefficient C of the generated networks divided by the clus- 
tering coefficient for comparable random networks. A large 
ratio is one feature of small-world networks, (b) Ratio of the 
average-shortest paths, ASP, of spatial-growth and compara- 
ble random networks. 



(a) (b) 
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FIG. 3: Exploration of model parameter space, (a) For dense 
networks {P — 1, N = 100 nodes), an increased dependence of 
edge formation on distance (parameter a) led to an increase 
of ASP (diamonds) and a decrease in clustering coefficient 
C (triangles), (b) Overview of network types for different 
spatial growth parameters (A'^ — 100 nodes) . Low values of a 
made edge formation independent from distance and resulted 
in random networks (black) . For large values of a only nodes 
near the existing network could establish connections, and the 
hard borders were not reached (virtually unlimited, green). 
The area labeled linear scale-free (blue) was a region in which 
sparse and highly linear networks showing a scale-free degree 
distribution occurred. Only a small part of the parameter 
space (red) showed properties of small-world networks. 

limited development, as all spatial networks eventually 
face internal or external constraints that confine growth, 
may it be geographical borders or limits of their ener- 
getic and material resources. However, virtually unlim- 
ited growth may be a good approximation for the early 
development of networks before reaching borders. 

IV. CLASSIFYING TYPES OF NETWORK 
DEVELOPMENT 

Different network growth types can be distinguished 
by assessing the evolution of network density and clus- 
tering coefficient. Growth with preferential attachment 



as well as spatial growth lead to clustering coefficients, 
C{N), that depend on the current size of the network, 
that is, the number of nodes, N (Fig. \^). While C{N) 
decreases with network size for networks generated by 
the BA- Model ^^J , it remains constant for spatial-growth 
networks. Virtually unlimited or limited spatial growth 
can thus be distinguished, since density decreases with 
network size for unlimited growth, while remaining con- 
stant for limited growth (Fig. 0Jd). 

Example: Evolution of metabolic networks. 
We applied this concept to classifying the development 
of real-world biological networks. The evolution of 
metabolic systems, for instance, can be seen as an in- 
corporation of new substances and their metabolic in- 
teractions into an existing reaction network. Reviewing 
43 metabolic networks in species of different organiza- 
tional level |l8:j, the clustering coefficient of these sys- 
tems remained constant across the scale pjj. whereas 
their density (Fig. Qt) decreased with network size. This 
indicated features of virtually unlimited network growth. 
The relation between the number of links and nodes in 
these systems was linear (Fig. 0Ji), with a slope of 5.2, so 
that the number of interactions of a metabolite was not 
increasing with network size. Such linear growth may en- 
sure that the metabolic systems remain connected (with 
the number of reactions larger than substances, as a nec- 
essary condition for connectedness), while not becoming 
too complex too quickly (as, for instance, with exponen- 
tial addition of new reactions). 



V. CONCLUSIONS 

We have proposed a new kind of spatial growth mecha- 
nism, incorporating both limited and virtually unlimited 
growth, that can produce a variety of metric real-world 
networks. The metric is not limited to Euclidian space 
as in the discussed examples, but may also use measures 
of similarity to define the link probability (e.g., social 
relations, |20l|). 

In contrast to previously studied spatial graphs |^, 
networks generated by our model were always connected. 
Moreover, the approach was able to generate small- world 
graphs, which is thought not to be possible in the spa- 
tial graph model in which positions are chosen randomly 
before edge formation |Q. Finally, the model was also 
able to produce scale-free networks with relatively low 
maximum degree, similar to, for example, the German 
highway system. 

A systematic evaluation of model parameter space was 
carried out at the specific network size of 100 nodes, 
which was feasible computationally. It would be inter- 
esting to also evaluate larger or smaller network sizes 
and to investigate for them, if small-world networks can 
be generated in a larger range of parameters a and f3. 

Several algorithms have been proposed for the gener- 
ation of different types of topological networks, in which 
links do not reflect physical distances, but merely the 
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FIG. 4: Comparison of the dependence of clustering coeffi- 
cient C{N) and density on network size (number of nodes, 
A'^). (a) For tlie simulated networks the clustering coefficient 
remained constant for limited (triangles, a = 5, /3 = 1) and 
virtually unlimited (boxes, a = 200, P — 1) spatial growth, 
but decreased for growth with preferential attachment (dia- 
monds), (b) Density was independent of network size only 
for limited spatial growth, (c) Density depending on network 
size (A'^) for the metabolic networks of 43 different organisms 
(15). (d) A critical measure for network development was 
the dependence of network size on the number of links. For 
metabolic networks, this relationship was strongly linear. 



connectivity of the system 0, 0, |^ • Examples for such 
networks include the World-Wide Web, financial trans- 
action networks, and, to some extent, networks of airline 
transportation. The present model extends previous ap- 
proaches to the development of spatial networks, such as 
cellular and brain connectivity networks, or food webs 
and many systems of social interactions. Spatial as well 
as temporal constraints shape network growth, and in- 
trinsic or external spatial limits may determine essential 
features of the structural organization of linked systems, 
such as clustering and scaling properties. Borders, for 
instance, appear to have been critical for early chemi- 
cal evolution, ensuring clustering of good replicators and 
preventing the sprea ding of short templates with limited 
replication function '22]. The same applies to cortical 
networks where elimination of growth limits results in a 
distorted network topology |2^ . 

The specific spatio-temporal conditions for the 
development of dilferent types of real-world networks 
warrant further investigation. They may be of additional 
interest, as local spatial growth mechanisms also imply 
glob )al optimization of path lengths in connected systems 
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